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OPTIMAL TEMPLATES FOR IMAGE DATA EMBEDDING 



FIELD OF THE INVENTION 

The invention relates generally to the field of digital image 
5 processing, and in particular to embedding and extracting hidden messages in 

digital image data. This field is also referred to as data hiding, information hiding, 
watermarking and also steganography. 



BACKGROUND OF THE INVENTION 

10 A technique for embedding messages in digital image data is 

disclosed by Honsinger, et. al., (see "Method for Generating an Improved Carrier 
for use in an Image Data Embedding Application," U.S Patent 6,044,1 56 issued 
March 28, 2000). This method for embedding a hidden message includes the 
steps of forming a digitize version of the desired message; generating a random 

1 5 phase carrier; convolving the digitized version of the desired message and the 
random phase carrier to form a dispersed message; and combining the dispersed 
message with the digitized image to form an embedded hidden message image. 
Hidden messages may be embedded in photographic images recorded on film and 
paper using conventional photographic printing techniques, or they may be 

20 embedded in images produced on printers such as ink-jet and thermal dye transfer 
printers, by combining the dispersed image with a digital image processing 
techniques. 

Hidden messages are recovered from the embedded hidden 
message image by cross correlating the embedded hidden message image 

25 containing the dispersed message with a decoding carrier to recover the embedded 
dispersed message; and extracting the digital image data for the recovered 
dispersed message. 

One known techniques for extracting dispersed message when the 
image has been cropped and/or rotated employs a separate calibration signal 

30 contained in the image. Once the calibration signal was found, it is used to 

determine the dispersed message boundaries and the knowledge of the boundaries 
is used to extract the message from the image. Such a technique is described by 
Rhoads in U.S. Patent 5,850,481, issued December 15, 1998. 
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One problem with adding a calibration signal to an image is that it 
takes extra bandwidth that could be used for other purposes, such as storing more 
information in the hidden message. There is a need therefore for an improved 
method of identifying the dispersed message boundaries without the need for a 
5 separate calibration signal to aid in extraction of the dispersed message. 

Honsinger, et. al., disclosed such a means (see U.S. Patent 
Application U.S. 09/453,160) to eliminate the requirement for a calibration signal. 
The method used by Honsinger is to build a message template used to indicate the 
location of the data in a message that is to be embedded in the image. The 

10 message template is represented as a rectangular array of pixel locations. Each 
pixel location of the template is either a one or a zero. If the pixel location 
contains a one, then the pixel location carries data for the hidden message. The 
message is encoded as a stream of binary bits, referred to as the original bit 
stream. The original bit stream, is first transformed to a stream of parity values, 

15 i.e., where the stream of bits are represented as +1 or -1, with the +1 representing 
a original bit value of 1 and a -1 representing an original bit value of 0. The pixel 
locations are then ordered in raster fashion to form a linear stream of data. 
Consequently, there are two data streams, the template data stream and the 
transformed original bit stream. It is convenient to consider the two streams as 

20 lists and for each list to have a pointer, referred to as the template pointer and the 
hidden message pointer. Both pointers are initially at the first elements of their 
respective lists. A hidden message data stream is formed as follows. The element 
of the template data stream pointed to by the template pointer is either a one or a 
zero. If it is a one, then the element pointed to by the hidden message pointer is 

25 entered into the hidden message stream and the hidden message pointer is 

advanced to the next element of the transformed original message stream. Other 
wise a zero is entered into the hidden message stream. In either case, the template 
pointer is advanced to the next template element. 

The message template can have many possible configurations 

30 depending upon the number of pixels in the template and the number of bits in the 
original bit stream. If the template is an n x n pixel array and the original bit 



stream has m bits, then there are 



possible configurations. It is not surprising 
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that some configurations have better properties than others do. Consider the 
template array as torus, meaning that the top and bottom of the array and linked, 
as are the left and right sides of the array. Two desirable properties of the array 
are that the pixels with a one are isolated, and that a simple shift does not result in 
5 template pattern being replicated. The reason for the first property is to improve 
the resolution of the hidden message and the second is to assure the order of the 
hidden message is unambiguous. 

Honsinger recognized that these are attractive properties and 
developed templates heuristically that had these characteristics. However, the 

1 0 method disclosed by Honsinger used a sequential means to attain the template and 
selected pixels based upon a predetermined template pattern, such as a spiral or a 
circle. Honsinger also mentions using a simulated annealing approach, but does 
not elaborate on the method. Two limitations arise from this approach, the first is 
number of potential templates considered is extremely restricted and the 

15 maximum number of bits permitted is determined arbitrarily. 

Consequently, a need exists for overcoming the above-described 

drawbacks. 

SUMMARY OF THE INVENTION 

20 The present invention measures the performance of a hidden 

message template, determines the optimal template for a prescribed number of 
hidden message bits. Moreover, the number of hidden message bits can be 
increased until the performance of the optimal template does not meet the 
performance requirements. The result is a hidden message template that has 

25 maximum data capacity and meeting the prescribed performance requirements. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a perspective diagram of a computer system for 
implementing the present invention; 
30 Fig. 2 is a diagram illustrating the hidden message template 

generation process; 

Fig. 3 is a logic flowchart of the hidden message template 
generation process; 



Fig. 4 is a diagram illustrating tiling of the hidden message across 

the image; and 

Fig. 5 is a diagram illustrating the impact of a replicating shift. 

DETAILED DESCRIPTION OF THE INVENTION 

In the following description, a preferred embodiment of the present 
invention will be described in terms that would ordinarily be implemented as a 
software program. Those skilled in the art will readily recognize that the 
equivalent of such software may also be constructed in hardware. Because image 
manipulation algorithms and systems are well known, the present description will 
be directed in particular to algorithms and systems forming part of, or cooperating 
more directly with, the system and method in accordance with the present 
invention. Other aspects of such algorithms and systems, and hardware and/or 
software for producing and otherwise processing the image signals involved 
therewith, not specifically shown or described herein, may be selected from such 
systems, algorithms, components and elements known in the art. Given the system 
as described according to the invention in the following materials, software not 
specifically shown, suggested or described herein that is useful for 
implementation of the invention is conventional and within the ordinary skill in 
such arts. 

Still further, as used herein, the computer program may be stored in 
a computer readable storage medium, which may comprise, for example; magnetic 
storage media such as a magnetic disk (such as a hard drive or a floppy disk) or 
magnetic tape; optical storage media such as an optical disc, optical tape, or 
machine readable bar code; solid state electronic storage devices such as random 
access memory (RAM), or read only memory (ROM); or any other physical 
device or medium employed to store a computer program. 

Before describing the present invention, it facilitates understanding 
to note that the present invention is preferably utilized on any well-known 
computer system, such as a personal computer. Consequently, the computer 
system will not be discussed in detail herein. It is also instructive to note that the 
images are either directly input into the computer system (for example by a digital 
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camera) or digitized before input into the computer system (for example by 
scanning an original, such as a silver halide film). 

Referring to Fig. 1, there is illustrated a computer system 1 10 for 
implementing the present invention. Although the computer system 1 10 is shown 
5 for the purpose of illustrating a preferred embodiment, the present invention is not 
limited to the computer system 110 shown, but may be used on any electronic 
processing system such as found in home computers, kiosks, retail or wholesale 
photofinishing, or any other system for the processing of digital images. The 
computer system 110 includes a microprocessor-based unit 1 12 for receiving and 

10 processing software programs and for performing other processing functions. A 
display 1 14 is electrically connected to the microprocessor-based unit 1 12 for 
displaying user-related information associated with the software, e.g., by means of 
a graphical user interface. A keyboard 1 1 6 is also connected to the 
microprocessor-based unit 1 12 for permitting a user to input information to the 

15 software. As an alternative to using the keyboard 1 16 for input, a mouse 1 1 8 may 
be used for moving a selector 120 on the display 1 14 and for selecting an item on 
which the selector 120 overlays, as is well known in the art. 

A compact disk-read only memory (CD-ROM) 124, which 
typically includes software programs, is inserted into the microprocessor-based 

20 unit 1 12 for providing a means of inputting the software programs and other 

information to the microprocessor-based unit 112. In addition, a floppy disk 126 
may also include a software program, and is inserted into the microprocessor- 
based unit 1 12 for inputting the software program. The compact disk-read only 
memory (CD-ROM) 124 or the floppy disk 126 may alternatively be inserted into 

25 externally located disk drive unit 122 that is connected to the microprocessor- 
based unit 112. Still further, the microprocessor-based unit 112 may be 
programmed, as is well known in the art, for storing the software program 
internally. The microprocessor-based unit 112 may also have a network 
connection 1 27, such as a telephone line, to an external network, such as a local 

30 area network or the Internet. A printer 128 may also be connected to the 

microprocessor-based unit 112 for printing a hardcopy of the output from the 
computer system 110. 
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Images may also be displayed on the display 114 via a personal 
computer card (PC card) 130, such as, as it was formerly known, a PCMCIA card 
(based on the specifications of the Personal Computer Memory Card International 
Association) that contains digitized images electronically embodied in the card 
5 130. The PC card 130 is ultimately inserted into the microprocessor-based unit 
1 12 for permitting visual display of the image on the display 114. Alternatively, 
the PC card 130 can be inserted into an externally located PC card reader 132 
connected to the microprocessor-based unit 112. Images may also be input via the 
compact disk 124, the floppy disk 126, or the network connection 127. Any 
. 10 images stored in the PC card 130, the floppy disk 126 or the compact disk 124, or 
input through the network connection 127, may have been obtained from a variety 
of sources, such as a digital camera (not shown) or a scanner (not shown). Images 
may also be input directly from a digital camera 134 via a camera docking port 
136 connected to the microprocessor-based unit 1 12 or directly from the digital 

15 camera 134 via a cable connection 138 to the microprocessor-based unit 1 12 or 
via a wireless connection 140 to the microprocessor-based unit 112. 

Referring to Figure 2, a description of the message template 200 is 
provided. The message template 200 is a rectangular array of pixels, though in 
normal practice the array is preferably a square. Each element of the array is 

20 either a one or a zero. If the element is a one, then the element will carry a single 
bit of the hidden message 210. In order to attach the hidden message 210 to the 
message template 200 it is convenient to consider the message template 200 as a 
linear stream, which is readily accomplished by considering each row of the 
message template in succession, in what is commonly referred to as raster format. 

25 The hidden message 210 is also a linear stream. The number of bits in the hidden 
message is at most the number of elements in the template being set to one. The 
message template stream 200 and the hidden message 210 can be considered to be 
stacks 220 each respectively with a pointer 230 and 240. The process is outlined 
as a flowchart in Figure 3. 

30 Referring to Figs. 2 and 3, the process begins S2 with both pointers 

230 and 240 positioned to the top of their respective stacks 200. If the pointer to 
the message template stack 220a points to an element with a one S4, then the 
pointer 240 to the hidden stack message 220b points S6 to the next element to be 
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included in the hidden message within the digital image. If the hidden message 
pointer 240 points to a value of one S8, then the hidden message template stream 
220a is given S10 a value of +1, otherwise the hidden message template stream is 
given S 1 1 a value of -1 . Subsequently, the hidden message pointer 240 is 
5 advanced S12 to the next element of the stack 220b. 

If the pointer 240 to the message template stack 220b is zero S4, 
then the hidden message stream pointer 240 is not addressed. In this case, the 
hidden message template stream is given SI 3 a value of 0. The message template 
pointer 230 is then advanced S14 to the next element in the stack 220a. The 

10 hidden template generation process continues until the message template pointer 
230 reaches SI 6 the end of the stack 220a. 

Referring to Figure 4, the hidden message template 300 generated 
hereinabove is applied to a digital image by tiling the hidden message template 
300 across the image 310 and adding the hidden message template 300 to the pixel 

1 5 values of the image 3 1 0. The hidden message template 300 can be seen to repeat 
both horizontally and vertically. For this reason it is customary to consider the 
hidden message template as a torus and distances between pixels in the hidden 
message template 300 are toroidal distances. For instance, if the message 
template is an array of n x m elements and if the toroidal distance between 

20 elements (ii, j 0 and fa, j2) is given by 



Tor((i x J x )ii 2 J 2 )) =ylv n (i,J 2 ) 2 + V m (/,,./ 2 ) 2 , 



where v n (a,b)=* 



/i-|a-A| if\a-b\>^ 



Before referring to Fig. 5, it is instructive to note that the hidden 
message template and hidden message will be shown in a different format with a 
25 different numeral for clarity. Referring now to Figure 5, since the method does 
not contain an explicit calibration signal, it must be inferred from the hidden 
message template 300 illustrated in another form as hidden message template 400 
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in Fig. 5. A (8x, 5y) shift is a mapping of the message template 400 to another 
template 410 of the form 

T(6x, 5y)(i,j) = ((i+8x)mod n, (j + Sy)mod m) 

5 

If after a (8x, 5y) shift the resulting template 410 is identical to the original hidden 
message template 400, then the hidden message template 400 is undesirable and is 
referred in the following as a replicating shift. This is because the inferred 
calibration might determine the shifted version and the original hidden message 
10 420 will be convoluted into a hidden message 430. Thus, it is preferred that any 
message template 300 should be free of shifts that map the template identically 
upon itself Another property that is desirable is that the location of set of non- 
zero elements of the message template be separated as widely from each other as 
possible. 

15 An optimal message template has a pattern of ones and zeroes 

where the ones are widely dispersed, but the alignment is free of replicating shifts 
within the message template. In order to determine an optimal message template 
it is necessary to determine a measure of the goodness of the message template. A 
measure that is appropriate is one that combines both the requirement to disperse 

20 to ones within the template and the other is to reward patterns that are resistant to 
replicating shifts. Consider a message template T as a collection of k locations 
within the n x m array, e.g., 

T = {p i \p i = (a /9 fl),0£a, <«,0<# <#w,l<i<*}. 

25 

A measure of the how well the message template disperses the hidden message, 
the spatial dispersion measure, is given by: 

Disp(T) = £ min(7V>r (p. , p. )±<j< kj * i) 

30 
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It is well known that the Fourier transform of a shifted signal has the same power 
spectrum, but the signal has a phase shift. If a message template is replicated by 
shift operator then the phase terms of each frequency term of the template in the 
Fourier domain is a multiple of 2rc, hence, if the number of unique phase terms in 
5 the frequency domain increase, the likelihood of such an event diminishes. 

Mathematically, the Fourier transform of the template is a discrete set of points in 
the complex plane. If we consider those points in polar format, the set is 
P=\tj exp(i0,)|y = 0,1,2, ,«}. Let ybe a threshold value 

A={&j \3p g P such that p = r } exp(i0j ) and r } > y). It should be noted that the 

10 elements of A are unique, meaning that if two points of the templates Fourier 

transform have the same phase angle, then the phase angle is included in set A at 
most once. The second measure is the number of unique phase terms in the 
frequency domain, or equivalently the cardinality of the set A. In practice, the 
measure uses the log2 of the number of unique phase terms, as this provides a 

15 measure of dispersion of the template in the frequency domain. Thus the second 
measure is: 



FDisp(T) = log 2 (\A\) 



20 This is somewhat analogous to the notion of information-theoretic entropy. 

The overall performance measure is the product of these two 
measures. This is given by: 



£l(T) = DispiT) x FDisp(T) 

25 

A simulated annealing algorithm described in more detail below will seek to 
maximize this measure O(T). 

The simulated annealing algorithm is a direct implementation of 
the basic algorithm. An initial message template is randomly selected. This 
30 template has the prescribed size with the prescribed number of one and zero 

elements. The arrangement of the template, referred to as the configuration, will 
be modified during the simulated annealing algorithm, in order to arrive at an 
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optimal configuration. There are two configurations of interest, the current 
configuration and the alternative configuration. The current configuration is the 
template that the algorithm maintains as the preferred template. The performance 
of this template is measured and used as a baseline for an alternative 
5 configuration. The algorithm also maintains a third template, referred to as the 
best template, and this template is the template that has the best performance of all 
templates evaluated by the algorithm. A simulated annealing algorithm operates 
by randomly selecting alternative configurations, measuring the performance of 
the configuration and comparing the configuration's performance to the current 

10 configuration performance. If the performance of the alternative algorithm 
exceeds some threshold then the alternative configuration replaces the current 
configuration as well as the performance of the alternative algorithm replaces the 
current performance measure. The threshold in a simulated annealing algorithm 
allows for alternative configurations to be accepted, though their respective 

1 5 performance is inferior to the current configuration. This is done by variable 

referred to as the temperature. Initially the algorithm assumes a hot temperature, 
so the threshold is reduced accordingly. During each temperature level a series of 
possible alternative message templates are examined and measured. If the new 
message pattern meets a level of acceptance it replaces the current pattern, and if 

20 the new pattern does not met the level of acceptance it is rejected. The level of 
acceptance is based upon the temperature and the performance measure of the 
current message template. In the limit the system only accepts new configurations 
if the performance exceeds the current configuration performance. The 
performance measure described above is used for this purpose and the alternative 

25 configurations are generated by selecting one of the 1 values of the template, 
replacing it with a zero and finding a location within the template holding a 0 
value and replacing the 0 with a 1 . The placement is constrained in the locations 
also need to be sufficiently isolated form other locations with a 1 value. As the 
algorithm proceeds, the current configuration is also compared to the best 

30 template and if the performance of the current template exceeds the best template, 
the current template is then saved as the best template. 

Upon completing the algorithm, the best template is stored to a 
computer file and is subsequently used an input to the data hiding process 



The invention has been described with reference to a preferred 
embodiment. However, it will be appreciated that variations and modifications 
can be effected by a person of ordinary skill in the art without departing from the 
scope of the invention. 
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PARTS LIST 

110 Computer System 

112 Microprocessor-based Unit 

114 Display 

116 Keyboard 

118 Mouse 

1 20 Selector on Display 

122 Externally Located Disk Drive Unit 

124 Compact Disk - read Only Memory (CD-ROM) 

126 Floppy Disk 

1 27 Network Connection 

128 Printer 

130 Personal Computer Card (PC card) 

1 32 Externally Located PC Card Reader 

134 Digital Camera 

136 Camera Docking Port 

138 Cable Connection 

140 Wireless Connection 

200 message template steam 

210 hidden message 

220 stacks 

230 pointer 

240 pointer 

300 hidden message template 

310 image 

400 illustrative hidden message template 

410 shifted hidden message template 

420 hidden message 

430 convoluted hidden message 



